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American  National  Standard 

Code  Extension  Techniques 

for  Use  with  the  7-Bit  Coded  Character  Set 

of  American  National  Standard  Code 

for  Information  Interchange 


1.  Introduction 

In  the  establishment  of  American  National  Standard 
Code  for  Information  Interchange  (ASCII),  X3. 4-1968, 
a  fundamental  decision  had  to  be  made  as  to  the  size  of 
the  code.  In  making  such  a  decision  there  is  usually  a 
conscious  effort  to  avoid  the  most  obvious  problems 
with  a  code  that  is  either  too  large  or  too  small.  Should 
the  number  of  characters  included  be  too  small,  many 
users  will  find  their  needs  not  accommodated  and  will 
be  forced  to  adopt  “parochial”  codes  for  their  applica¬ 
tions.  Should  the  number  of  characters  be  too  large, 
many  users  will  find  the  code  disproportionately  costly 
to  implement,  or  untenably  inefficient  in  transmission 
or  storage,  and  will  again  be  driven  to  the  use  of  some 
other  code.  Thus,  either  extreme  in  code  sizing  will  re¬ 
duce  the  generality  of  the  code,  defeating  the  very  pur¬ 
pose  of  standardization  in  this  field. 

The  7-bit  size  ( 1 28  characters)  adopted  for  ASCII  is 
thought  to  be  near  optimum  at  present  with  respect  to 
the  foregoing  considerations.  Nevertheless,  there  will  be 
numerous  applications  with  requirements  that  are  not 
accommodated  by  a  code  of  this  size,  or  at  least  not  by 
the  specific  characters  assigned  within  it.  Many  of  these 
applications  can  be  served  by  the  use  of  the  standard 
code  augmented  by  standard  code  extension  procedures. 
Through  such  an  approach,  the  user  may  be  able  to  im¬ 
plement  much  of  his  system  with  standard  hardware  and 
software.  More  significantly,  he  will  thereby  be  able  to 
retain  compatibility  with  other  systems  for  the  inter¬ 
change  of  that  information  which  can  adequately  be 
directly  represented  by  the  standard  code. 

2.  General 
2.1  Scope 

2.1.1  This  standard  specifies  methods  of  extending 
the  7-bit  code  remaining  in  a  7-bit  environment  or  in¬ 


creasing  to  an  8-bit  environment.  The  description  of 
techniques  is  contained  in  four  interrelated  sections 
dealing  respectively  with: 

( 1 )  The  extension  of  the  7-bit  code  remaining  in  a 

7- bit  environment 

(2)  The  structure  of  a  family  of  8-bit  codes 

(3)  The  extension  of  an  8-bit  code  remaining  in  an 

8- bit  environment 

(4)  The  relationship  between  the  7-bit  code  and  an 
8-bit  code 

2.1.2  While  ASCII  is  the  agreed-upon  code  for  infor¬ 
mation  interchange,  an  8-bit  code  as  described  in  this 
standard  is  provided  for  information  interchange  within 
an  8-bit  environment. 

2.1.3  It  is  not  the  intention  of  this  standard  that  all 
instances  of  its  application  accommodate  all  of  its  pro¬ 
visions.  However,  it  is  intended  that,  when  code  exten¬ 
sion  techniques  are  used,  the  applicable  parts  of  this 
standard  are  to  be  followed. 

When  two  systems  with  different  implementations 
of  code  extension  techniques  are  required  to  communi¬ 
cate  with  one  another,  they  shall  do  so  using  only  the 
code  extension  techniques  they  have  in  common. 

2.1.4  Code  extension  techniques  are  classified,  and 
some  classes  are  given  a  structure  in  this  standard. 

Other  assignments  of  bit  combinations  associated  with 
the  designation  of  the  classes  will  be  made  in  accor¬ 
dance  with  the  procedures  for  the  registration  of  char¬ 
acters  and  character  sets  given  in  a  related  standard  cur¬ 
rently  under  development.  Specific  assignments  of  bit 
combinations  to  relate  individual  codes  with  their  invo¬ 
cation  or  designation  will  also  be  made  in  accordance 
with  that  standard. 

2.1.5  Code  extension  techniques  are  designed  to 
be  used  for  data  to  be  processed  serially  in  a  forward 
direction.  Use  of  these  techniques  in  strings  of  data  that 
are  processed  other  than  serially  in  a  forward  direction 
or  included  in  data  formatted  for  fixed  record  process¬ 
ing  may  have  undesirable  results. 
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2.2  Purpose 

2.2.1  American  National  Standard  Code  for  Infor¬ 
mation  Interchange  (ASCII),  X3.4-1968,  specifies  the 
representation  of  128  characters.  Additionally,  it  al¬ 
lows  the  representation  of  several  other  graphics  by  the 
combination  of  two  graphic  characters  with  the  Back¬ 
space  control  character.  In  many  instances,  ASCII  lacks 
either  controls  or  graphics  to  sufficiently  satisfy  the 
needs  of  an  application.  These  needs  may  be  satisfied 
by  means  of  code  extension,  which  is  the  subject  of 
this  standard. 

This  standard  is  intended  to  present  a  review  of  the 
salient  structure  of  ASCII  and  then  build  upon  that 
structure  to  describe  various  means  of  extending  the 
control  and  graphic  set  of  the  code.  It  also  describes 
structures  and  techniques  to  construct  or  formalize 
codes  related  to  ASCII.  These  related  codes  are  struc¬ 
tured  so  as  to  allow  application-dependent  usage  with¬ 
out  preventing  the  interchangeability  of  their  data. 

2.2.2  The  standard  describes  the  following: 

(1)  The  structure  of  ASCII 

(2)  Extension  of  the  7-bit  code,  remaining  in  a  7-bit 
environment,  and  making  use  of  code  extension  tech¬ 
niques 

(3)  Increasing  the  number  of  bits  to  8,  yet  retain¬ 
ing  a  structure  compatible  with  the  7-bit  structure 

(4)  Increasing  the  number  of  bits  to  8  and  applying 
similar  code  extension  techniques 

2.2.3  In  order  to  use  identical  techniques  in  each 
of  the  cases  mentioned  in  2.2.2,  and  to  facilitate  con¬ 
version  between  them,  standard  rules  are  necessary. 

This  has  the  advantage  of: 

(1)  Reducing  the  risk  of  conflict  between  systems 
required  to  interoperate 

(2)  Permitting  provision  for  code  extension  in  the 
design  of  systems 

(3)  Providing  standardized  methods  of  calling  into 
use  agreed-upon  sets  of  characters 

(4)  Allowing  the  interchange  of  data  between  7-bit 
and  8-bit  environments 

2.2.4  This  standard  also  describes  the  structure  of 
families  of  codes  that  are  related  to  ASCII  by  their 
structure. 

2.3  Application.  Characters  and  character  sequences 
conforming  to  this  standard  are  used  in  ASCII  or  ASCII- 
related  information  interchange  where  additional  con¬ 
trol  or  graphic  characters,  or  both,  not  in  ASCII  must 
be  provided. 

2.4  Conformance.  Recorded  or  transmitted  data  are  in 
conformance  with  this  standard  if  all  those  bit  patterns 
contained  in  the  data  stream  have  exactly,  and  only, 
the  meanings  specified  in  this  standard  and  in  a  related 
standard  currently  under  development. 


WARNING:  Products  may  provide  or  process  bit  patterns, 
code  extension  procedures,  classes  of  Escape  sequences,  and 
code  relationships  not  specified  in  this  standard.  Products  may 
have  the  capability  to  record  or  transmit  data  that  contain  con¬ 
tiguous  bits  or  bytes,  the  meaning  of  which  is  not  intended  as 
those  specified  in  this  standard. 

A  user  application  program  may  cause  nonconforming  bit 
patterns  to  be  transmitted  in  interchange  data  or  require  the 
use  of  bit  patterns,  code  extension  procedures,  classes  of  Es¬ 
cape  sequences,  and  code  relationships  not  as  specified  in  this 
standard  for  further  processing  of  the  application  data. 

The  use  of  such  product  features  or  user  applications  may 
render  the  data  in  nonconformance  with  this  standard. 


3.  Definitions 

In  this  standard,  the  following  definitions  shall  apply: 

bit  combination.  An  ordered  set  of  bits  that  represents 
a  character. 

byte.  A  bit  string  that  is  operated  upon  as  a  unit  and 
whose  size  is  independent  of  redundancy  or  framing 
techniques. 

character.  A  member  of  a  set  of  elements  that  is  used 
for  the  organization,  control,  or  representation  of  data. 

code,  coded  character  set.  A  set  of  unambiguous  rules 
that  establish  a  character  set  and  the  one-to-one  rela¬ 
tionships  between  the  characters  of  the  set  and  their 
bit  combinations. 

code  extension.  Techniques  for  the  encoding  of  charac¬ 
ters  that  are  not  included  in  the  character  set  of  a  given 
code. 

code  table.  A  table  showing  the  character  correspond¬ 
ing  to  each  bit  combination  in  a  code. 

control  character.  A  character  whose  occurrence  in  a 
particular  context  initiates,  modifies,  or  stops  a  control 
function. 

control  function.  An  action  that  affects  the  recording, 
processing,  transmission,  or  interpretation  of  data. 

designate.  To  identify  a  set  of  characters  that  are  to  be 
represented,  in  some  cases  immediately  and  in  others 
on  the  occurrence  of  a  further  control  function,  in  a 
prescribed  manner. 

environment.  The  characteristic  that  identifies  the  num¬ 
ber  of  bits  used  to  represent  a  character  in  a  data  pro¬ 
cessing  or  data  communication  system  or  in  part  of  such 
a  system. 

Escape  sequence.  A  bit  string  that  is  used  for  control 
purposes  in  code  extension  procedures  and  that  con¬ 
sists  of  two  or  more  bit  combinations,  of  which  the 
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first  is  the  bit  combination  corresponding  to  the  Escape 
character. 

Final  character.  The  character  whose  bit  combination 
terminates  an  Escape  sequence. 

graphic  character.  A  character,  other  than  a  control 
character,  that  has  a  visual  representation  normally 
handwritten,  printed,  or  displayed. 

Intermediate  character.  A  character  whose  bit  combi- 


b7  b6  b5  b4  b3  b2  b, 


The  bits  of  a  7-bit 
combination: 

The  bits  of  an  8-bit 

combination:  a8  a7  a6  a5  a4  a3  a2  a! 

Bit  weight  for  col¬ 
umn  and  row  Column  Row 

reference:  23  22  21  2°  23  22  21  2° 

4.3  Character  Notations.  Characters  may  be  repre¬ 
sented  as  follows: 


nation  occurs  between  the  Escape  character  and  the 

Mnemonics: 

ENQ 

ETB 

Final  character  in  an  Escape  sequence,  consisting  of 

Denotation: 

more  than  two  bit  combinations. 

(Location  in  code  table 

[column/row]  ): 

0/5 

1/7 

invoke.  To  cause  a  designated  set  of  characters  to  be 

Bit  pattern  (byte): 

0000101 

00101 1 

represented  by  the  prescribed  bit  combinations  when¬ 
ever  those  bit  combinations  occur,  until  an  appropriate 
code  extension  function  occurs. 

position.  An  item  in  a  code  table  identified  by  its  col¬ 
umn  and  row  coordinates. 


Character  mnemonics  such  as  SO  and  ESC,  and  column/ 
row  numbers  such  as  0/5  and  1/7,  are  as  shown  to  empha¬ 
size  the  fact  that  they  stand  for  one  bit  combination  only. 


represent.  To  use  a  prescribed  bit  combination  with  the 
meaning  of  a  character  in  a  set  of  characters  that  has 
been  designated  and  invoked.  To  use  an  Escape  sequence 
with  the  meaning  of  an  additional  control  character. 


4.  Notational  Conventions 

4.1  General.  The  primary  concept  of  code  extension  is 
to  allow  the  meaning  associated  with  a  bit  pattern  to  be 
changed  in  a  discrete  and  orderly  way.  As  a  result, 
some  shorthand  notations  are  introduced  to  disassociate 
bit  patterns  and  classes  of  bit  patterns  from  the  mean¬ 
ings  in  any  one  code  set.  A  number  of  notational  con¬ 
ventions  have  been  established  in  this  document  to  sim¬ 
plify  the  expressions  for  code  extension. 

A  bit  combination  is  sometimes  referred  to  by  the 
column  and  row  numbers  of  its  position  in  the  code 
table.  The  column  number  is  the  sum  of  the  decimal 
equivalent  of  bits  b7-b5  (or  a8-a5 ),  and  the  row  num¬ 
ber  is  the  sum  of  the  decimal  equivalent  of  bits  b4-b, 

(or  a4-aj ),  giving  to  these  bits  the  weights  given  in 
4.2. 

In  representing  the  decimal  equivalents,  the  conven¬ 
tion  is  to  append  a  leading  zero  to  the  column  number 
for  8-bit  columns  00-09.  As  an  example,  the  position 
of  the  Space  character  in  the  7-bit  code  table  is  2/0; 
the  position  of  the  same  character  in  an  8-bit  code 
table  is  02/0. 

4.2  Character  Representation.  In  this  standard  the  fol¬ 
lowing  notations  are  used: 


5.  Extension  of  the  7-Bit  Code  Remaining  in  a 
7-Bit  Environment 

5.1  Introduction 

5.1.1  The  Structure  of  the  7-Bit  Code.  The  7-bit 
code  table  of  ASCII  consists  of  an  ordered  set  of  con¬ 
trols  and  graphic  characters  grouped  (see  Fig.  1)  as 
follows: 

(1)  The  set  of  thirty-two  control  characters  allo¬ 
cated  to  columns  0  and  1  (hereafter  referred  to  as  the 
CO  set) 

(2)  The  Space  character  in  position  2/0,  which  may 
be  regarded  either  as  a  control  character  or  a  nonprint¬ 
ing  graphic  character 

(3)  The  set  of  ninety-four  graphic  characters  allocat¬ 
ed  to  columns  2-7  (hereafter  referred  to  as  the  GO  set) 

(4)  The  Delete  character  in  position  7/15 


Ol  2  3  4  5  6  7 


The  Structure  of  the  7-Bit  Code 
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5.1.2  Extension  by  Substitution.  In  many  cases  the 
provisions  of  ASCII  will  satisfy  the  requirements  of  an 
application.  Other  applications,  however,  will  be  satis¬ 
fied  by  the  use  of  a  similarly  structured  code  in  which 
some  of  the  characters  of  ASCII  are  substituted  by 
other  characters.  Such  a  substitution  may  be  regarded 
as  a  replacement  of  the  CO  set  or  the  GO  set,  or  both, 
accordingly  as  new  controls  or  graphics,  or  both,  are 
required  (see  Appendix  B  of  American  National  Stan¬ 
dard  X3.4-1968). 

5.1.3  Extension  by  Increasing  the  Repertory  of 
Characters.  This  standard  provides  for  the  addition 
of  characters  to  the  1  28  provided  by  the  structure 
of  the  7-bit  code  in  the  following  ways: 

(1)  Additional  single  controls 

(2)  Additional  sets  of  thirty-two  control  characters 
(referred  to  as  Cl  sets) 

(3)  Additional  sets  of  ninety-four  graphic  characters 
(referred  to  as  G1  sets) 

(4)  Sets  of  more  than  ninety-four  graphic  characters, 
each  represented  by  more  than  one  byte  (these  sets 
function  as  GO  sets) 

Many  applications  will  require  combinations  of  the 
aforementioned  facilities.  These  facilities  are  shown 
in  Fig.  2. 

5.1.4  Code  Extension  Characters.  In  the  7 -bit  code, 
the  following  characters  are  provided  for  the  purpose 
of  code  extension: 

Escape  character:  ESC 

Shift-Out  character:  SO 

Shift-In  character:  SI 

Data  Link  Escape  character:  DLE 

This  standard  does  not  describe  the  use  of  the  Data 
Link  Escape  character,  which  is  reserved  for  the  provi¬ 
sion  of  additional  transmission  controls;  the  use  of  this 
character  is  specified  in  American  National  Standard 


OTHER  ADDITIONAL 
SINGLE  CONTROLS 


r 


Fig.  2 

Structure  for  the  Extended  7-Bit  Code 


Procedures  for  the  Use  of  the  Communication  Control 
Characters  of  American  National  Standard  Code  for 
Information  Interchange  in  Specified  Data  Communica¬ 
tion  Links,  X3. 28-1971 . 

5.1.5  Compatibility.  For  purposes  of  interchange, 
various  levels  of  compatibility  that  may  be  preserved 
among  extension  facilities  are  identified  as  follows: 

(1)  A  set  that  is  compatible  with  ASCII  inasmuch  as: 

(a)  Columns  0  and  1  contain  only  control  char¬ 
acters 

(b)  Columns  2-7  are  used  for  graphic  characters 
only  (apart  from  DEL) 

(c)  The  ten  transmission  control  characters  and 
NUL,  SO,  SI,  CAN,  SUB,  ESC,  and  DEL  remain  unal¬ 
tered  in  their  meanings  and  in  their  positions  in  the 
code  table 

(d)  Graphics  of  ASCII  are  not  moved  to  other 
positions  (A  non-Latin  alphabet  containing  graphics 
that  are  also  included  in  the  Latin  alphabet  is  not  sub¬ 
ject  to  this  rule.) 

(2)  Other  sets  structured  as  in  5.1.1 

To  be  able  to  provide  the  facilities  of  code  exten¬ 
sion  given  in  this  standard,  the  Escape,  or  the  Shift-Out 
and  Shift-In  characters,  or  both,  shall  remain  unaltered 
in  their  meanings  and  their  positions  in  the  code  table. 

5.2  Extension  of  the  Graphic  Set  by  Means  of  the 
Shift-Out  and  Shift-In  Characters 

5.2.1  Use  of  SO  and  SI.  The  Shift-Out  character 
SO  and  the  Shift-In  character  SI  are  used  exclusively 
for  extension  of  the  graphics. 

The  character  SO  invokes  an  alternative  set  of  ninety- 
four  graphics,  the  G1  set.  This  set  replaces  the  graphic 
characters  of  the  GO  set.  Graphic  characters  need  not  be 
assigned  to  all  the  positions  of  the  alternative  set,  nor 
need  all  the  graphic  characters  of  the  alternative  set  be 
different  from  the  graphic  characters  of  the  GO  set. 

The  character  SI  invokes  the  graphic  characters  of 
the  GO  set  that  are  to  replace  the  graphic  characters  of 
the  alternative  set. 

The  meanings  ol  the  following  bit  combinations  are 
not  affected  by  the  occurrence  of  SO  and  SI: 

( 1  )  Those  corresponding  to  the  control  characters 
in  columns  0  and  1  and  the  Delete  character  in  position 
7/15 

(2)  The  one  corresponding  to  the  Space  character  in 
position  2/0 

(3)  Those  included  in  any  Escape  sequence 

The  Space  character  occurs  only  at  position  2/0;  it 
shall  not  be  assigned  to  any  position  in  the  alternative 
graphic  set.  These  provisions  do  not  preclude  the  assign¬ 
ment  to  positions  in  any  graphic  set  of  characters  equiva¬ 
lent  to  spaces  ot  size  other  than  that  of  the  Space  as¬ 
signed  to  position  2/0,  as,  for  example,  half-space. 
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GO  SET  I 

GO  SET  2 
GO  SET  3 
GO  SET  4 


GO  SET  n 

Gl  SET  I 
G  I  SET  2 
Gl  SET  3 
Gl  SET  4 


Gl  SET  n 


Fig.  3 

Schematic  Illustration  of  the  Shift-Out  and  Shift-In  Concept 


At  the  beginning  of  any  information  interchange,  the 
shift  status  should  be  defined  by  SI  or  SO.  When  in  the 
Shift-In  state,  SI  has  no  effect;  and  when  in  the  Shift- 
Out  state,  SO  has  no  effect. 

5.2.2  Unique  Shift-Out  Set.  Some  applications  re¬ 
quire  the  use  of  only  one  alternative  set  of  ninety-four 
graphic  characters.  In  such  a  case,  that  unique  set  is  in¬ 
voked  by  each  use  of  SO.  The  set  is  identified  either  by 
an  appropriate  ESC  sequence  as  described  in  5.3.7  or  by 
agreement  between  or  among  the  interchanging  parties. 

5.2.3  Multiple  Shift-Out  Sets.  If  two  or  more  alterna¬ 
tive  graphic  sets  are  required  to  coexist  in  a  system,  the 
set  to  be  used  next  is  designated  by  the  appropriate  ESC 
sequence.  That  set  can  then  be  invoked  by  the  use  of  SO. 

The  use  of  SI  reinvokes  the  graphics  of  the  GO  set 
last  designated,  but  does  not  affect  the  identity  of  the 
designated  alternative  set.  The  alternative  set  may  be  in¬ 
voked  any  number  of  times  by  successive  use  of  SO  un¬ 
til  it  is  superseded  by  another  alternative  set  designated 
by  another  Escape  sequence. 

It  is  not  necessary  to  revert  to  the  GO  set  by  use  of 
SI  before  changing  from  one  alternative  set  to  another 
by  means  of  a  further  Escape  sequence.  When  the  sys¬ 
tem  is  in  the  Shift-Out  state,  the  use  of  such  a  further 
Escape  sequence  leaves  the  shift  status  unaltered,  and 
the  alternative  set  is  invoked. 

A  schematic  representation  of  the  aforementioned 
is  given  in  Fig.  3. 

WARNING:  In  some  devices  or  systems  there  may  be  a  require¬ 
ment  to  reestablish  the  Shit t-ln  state  before  designating  a  new 
Shift-Out  set  by  means  of  an  Escape  sequence.  This  can  be 
achieved  by  inserting  SI  before  the  Escape  sequence  which  des¬ 
ignates  the  subsequent  Shift-Out  set.  Such  a  requirement  shall 
be  agreed  upon  between  or  among  any  interchanging  parties. 


5.3  Code  Extension  by  Means  of  Escape  Sequences 

5.3.1  Purposes  of  Escape  Sequences.  Escape  se¬ 
quences  provide  single  or  sets  of  control  functions 
other  than  for  transmission  control.  Escape  sequences 
are  also  used  to  designate  sets  of  graphics,  different  uses 
of  some  or  all  of  the  7-bit  code  combinations,  and  coded 
character  sets  with  a  number  of  bits  other  than  seven. 

Thus,  Escape  sequences  are  required  to  provide,  for 
example: 

( 1 )  A  single  control  character  not  already  in  the 
code 

(2)  A  set  of  control  characters  not  already  in  the 
code 

(3)  A  set  of  graphic  characters  not  already  in  the 
code 

(4)  A  code  of  different  structure 

5.3.2  Structure  of  Escape  Sequences.  An  Escape 
sequence  consists  of  two  or  more  7-bit  combinations. 
The  first  is  always  the  bit  combination  of  ESC  and  the 
last  is  always  one  of  the  Final  characters.  An  Escape 
sequence  may  also  contain  any  number  of  7-bit  com¬ 
binations  representing  Intermediate  characters. 

The  meaning  of  an  Escape  sequence  is  determined 
by  the  7-bit  combination  representing  its  Intermediate 
character(s),  if  any,  and  by  the  7-bit  combination  rep¬ 
resenting  its  Final  character. 

W'ARNING:  Although  in  this  standard  Escape  sequences  are 
described  in  terms  of  characters  or  of  positions  in  the  code 
tables,  the  meaning  of  an  Escape  sequence  is  determined  only 
by  its  bit  combinations,  and  it  is  unaffected  by  any  meaning 
previously  assigned  to  these  bit  combinations  taken  individ¬ 
ually. 

Intermediate  characters  are  the  sixteen  characters  of 
column  2  of  the  7 -bit  code  table. 
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NOTE:  In  this  standard,  any  one  of  these  sixteen  Intermediate 
characters  is  denoted  by  the  symbol  "I.” 

Final  characters  are  the  seventy-nine  characters  of 
columns  3-7  of  the  7-bit  code  table  excluding  position 
7/15. 

NOTE:  In  this  standard,  any  one  of  these  seventy-nine  Pinal 
characters  is  denoted  by  the  symbol  “F.” 

Prohibited  characters  are  the  control  characters  in 
columns  0  and  1  and  the  character  in  position  7/15. 

The  thirty-three  prohibited  characters  shall  not  be 
used  as  either  Intermediate  or  Final  characters  to  con¬ 
struct  an  Escape  sequence. 

As  these  prohibited  characters  may  appear  in  an 
ESC  sequence  in  error,  it  may  be  necessary  within  an 
application  to  provide  methods  of  identifying  such  a 
situation  and  of  recovering  from  it,  but  this  is  not  cov¬ 
ered  in  this  standard  (see  Section  A1  of  Appendix  A). 

5.3.3  Categories  of  Escape  Sequences.  The  cate¬ 
gories  of  Escape  sequences  are  specified  in  this  stan¬ 
dard.  However,  Escape  sequences  with  Final  characters 
from  column  3  are  reserved  for  private  use  subject  to 
the  categorization  outlined  in  5.3.3. 1  and  5. 3. 3. 2. 

WARNING:  The  implementors  of  any  private  Escape  se¬ 
quence  described  as  such  in  this  document  are  alerted  to  the 
fact  that  other  implementors  may  give  different  meanings  to 
the  same  Escape  sequence  or  may  use  different  Escape  sequences 
to  mean  the  same  thing.  Furthermore,  such  meanings  may  sub¬ 
sequently  be  assigned  to  standardized  Escape  sequences.  Inter¬ 
changing  parties  are  warned,  therefore,  that  such  agreement  may 
reduce  their  capability  to  interchange  subsequently. 

5.3.3. 1  Two-Character  Escape  Sequences.  A  two- 
character  Escape  sequence  takes  the  form  ESC  F.  Such 
Escape  sequences  are  used  to  represent  single  additional 
characters.  The  seventy-nine  two-character  Escape  se¬ 
quences  are  split  into  three  types,  depending  on  the 
Final  character,  as  shown  in  Fig.  4. 

An  ESC  Fs  sequence  represents,  depending  on  the 
Final  character  used,  a  single  additional  standardized 
control  character.  Thirty -one  Final  characters  of  col¬ 
umns  6  and  7  are  provided  for  this  purpose. 

An  ESC  Fe  sequence  represents,  depending  on  the 
Final  character  used,  an  individual  control  character  of 
an  additional  standardized  set  of  thirty-two  control 
characters  (see  5.3.6).  The  thirty-two  Final  characters 
of  columns  4  and  5  are  provided  for  this  purpose.  Some 
applications  require  the  use  of  only  one  such  additional 
set.  In  this  case,  the  set  is  identified  either  by  the  appro¬ 
priate  ESC  sequence,  as  described  in  5.3.6,  or  by  agree¬ 
ment  between  or  among  the  interchanging  parties.  If 
more  than  one  additional  set  of  controls  are  required 
to  coexist  in  a  system,  the  set  to  be  used  next  is  desig¬ 
nated  and  invoked  by  the  appropriate  ESC  sequence. 

An  ESC  Fp  sequence  represents,  depending  on  the 
Final  character  used,  a  single  additional  character  with¬ 


out  standardized  meaning,  for  private  use  as  required, 
subject  to  the  prior  agreement  of  the  sender  and  the 
recipient  of  the  data.  The  sixteen  Final  characters  of 
column  3  are  provided  for  this  purpose. 

5. 3. 3. 2  Three-Character  Escape  Sequences.  A 
three-character  Escape  sequence  takes  the  form  ESC  I  F. 

All  types  of  three-character  Escape  sequences  are 
grouped  into  categories  according  to  their  purpose,  by 
means  of  their  Intermediate  characters,  as  shown  in 
5.3.4  through  5.3.10  (see  Table  1 ). 

These  sequences  are  split  into  two  types  according 
to  their  Final  character  as  shown  in  Fig.  5. 

ESC  1  Ft  sequences  are  used  for  standardized  pur¬ 
poses.  The  sixty-three  Ft  characters  of  columns  4-7 
are  provided  for  this  purpose. 


0 

1 

2 

3 

4  5 

6  7 

0 

Fp 

Fe 

Fs 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1  1 

12 

13 

14 

15 

Fig.  4 

Portions  of  the  Code  Table  Used  in  Two-Character 
Escape  Sequences 


0 

1 

2 

3 

4  |  5  6  7 

6 

l 

Fp 

Ft 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 1 

12 

13 

14 

15 

1 

Fig.  5 

Portions  of  the  Code  Table  Used  in 
Three-Character  Escape  Sequences 
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ESC  I  Fp  sequences  are  reserved  for  private  use.  The 
sixteen  Fp  characters  of  column  3  are  provided  for  this 
purpose. 

5.3.4  Single  Additional  Characters.  ESC  2/3  F  rep¬ 
resents,  depending  on  the  Final  character  used,  a  single 
additional  character. 

5.3.5  Sets  of  Thirty-Two  Control  Characters  for 
Columns  0  and  1 .  ESC  2/ 1  F  designates  and  invokes 
the  CO  set  of  thirty-two  control  characters  for  represen¬ 
tation  by  the  bit  combinations  of  columns  0  and  1. 

The  ten  transmission  control  characters,  when  in¬ 
cluded  in  a  CO  set,  shall  retain  their  meanings  and  their 
positions  in  the  code  table.  No  other  transmission  con¬ 
trol  characters  shall  be  included  in  a  CO  set. 

To  reduce  the  risk  of  conflict  in  the  interchange  of 
data,  this  set  should  have  the  following  characteristics: 

(1)  Inclusion  of  the  ten  transmission  control  char¬ 
acters 

(2)  Inclusion  of  the  characters  NUF,  SO,  SI,  CAN, 
SUB,  and  ESC  with  their  meanings  and  their  position 
in  the  code  table  unaltered 

Consideration  should  be  given  to  the  effect  that 
changing  the  meaning  of  control  characters  can  have  on 
equipment  when  interchanging  data.  For  example,  the 
bit  combination  corresponding  to  HT  will  have  the  ef¬ 
fect  of  “horizontal  tabulation”  to  a  system  designed  to 
respond  to  this  control  character. 

5.3.6  Sets  of  Thirty-Two  Control  Characters  for 
Representation  by  ESC  Fe.  ESC  2/2  F  designates  and 
invokes  the  Cl  set  of  thirty -two  control  characters  with¬ 
out  affecting  the  CO  set.  Individual  control  characters 
of  such  a  set  are  represented  by  means  of  ESC  Fe  se¬ 
quences  rather  than  single-character  bit  combinations.  A 
Cl  set  shall  not  include  transmission  control  characters. 

5.3.7  Sets  of  Ninety-Four  Graphic  Characters. 

ESC  2/8  F  and  ESC  2/12  F  designate  sets  of  ninety- 
four  graphic  characters  that  will  be  used  as  the  GO  set. 
The  designated  set  is  invoked  by  SI. 

ESC  2/9  F  and  ESC  2/13  F  designate  sets  of  ninety- 
four  graphic  characters  that  will  be  used  as  the  G1  set. 
The  designated  set  is  invoked  by  SO. 

The  two  aforementioned  groups  of  graphic  sets  to¬ 
gether  make  up  a  single  repertory  of  graphic  sets  which 
may  be  designated  to  either  of  two  available  positions, 
GO  or  G1 .  No  significance  is  attached  to  the  groupings 
other  than  that  their  existence  allows  more  such  sets 
of  ninety-four  graphic  characters  to  be  defined  within 
the  scope  of  threc-character  Escape  sequences.  There 
are,  therefore,  126  such  sets  possible  for  standardiza¬ 
tion  without  requiring  further  extension  (see  5.3.12). 

5.3.8  Codes  That  Require  Special  Interpretation. 
ESC  2/5  F  designates  and  invokes  a  code  that  requires 
special  interpretation,  such  as: 

(1)  A  code  with  a  number  of  bits  other  than  seven, 


excluding  those  8-bit  codes  structured  in  accordance 
with  this  standard 

(2)  A  7-bit  code  whose  characteristics  differ  from 
those  in  this  standard 

The  Final  character  assignments  are  such  that,  within 
the  Ft  and  Fp  groups  (columns  3-7),  the  following  clas¬ 
sification  occurs: 

Final  in 

Column  Broad  Categorization 

3  Private  code  with  any  number  of  bits 

4  Code  of  less  than  7  bits 

5  Code  of  7  bits 

6  Code  of  8  bits 

7  Code  of  more  than  8  bits 

5.3.9  Sets  of  Graphics  with  Multiple-Byte  Representa¬ 
tion.  ESC  2/4  F  designates  sets  of  graphic  characters  that 
are  represented  by  two  or  more  bytes,  each  correspond¬ 
ing  to  a  bit  combination  in  columns  2-7,  apart  from 
positions  2/0  and  7/15.  The  designated  set  is  invoked  by 
SI  and  is  therefore  regarded  as  a  GO  set.  Within  such  a 
set,  each  graphic  character  is  represented  by  the  same 
number  of  bytes  as  shown  in  Fig  6. 

5.3.10  Announcement  of  Extension  Facilities. 

ESC  2/0  F  announces  the  extension  facilities  used  in 
conjunction  with  data  that  follow.  The  use  of  these  se¬ 
quences  is  specified  in  Section  8. 

5.3.1 1  Three-Character  Escape  Sequences  without 
Assigned  Meanings.  The  three-character  Escape  se¬ 
quences  ESC  2/6  F,  ESC  2/7  F,  ESC  2/10  F,  ESC  2/1 1  F, 
ESC  2/14  F,  and  ESC  2/15  F  have  not  been  assigned 
meanings  and  are  reserved  for  future  standardization. 

5.3.12  Escape  Sequences  Having  Four  or  More  Char¬ 
acters.  Escape  sequences  having  four  or  more  characters 
shall  be  interpreted  according  to  the  following: 

(1)  The  first  Intermediate  character  shall  indicate 
the  category  of  usage  identical  with  three-character 
Escape  sequences  in  5.3.1 1 


Fig.  6 

The  Multiple-Byte  GO  Set 
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Table  1 

Summary  of  Assignments  of  Three-Character  Escape  Sequences 


Column/ 

Row 

Bits  of  Intermediate 
b7  b6  .  .  .  .  b. 

Category 

Grouping 

Subsection 

2/0 

0100000 

Announcers 

5.3.10 

2/1 

0100001 

CO  set 

5.3.5 

2/2 

0100010 

CONTROLS 

Cl  set 

5.3.6 

2/3 

0  1  0  0  0  1  1 

Single  characters 

5.3.4 

2/4 

0100100 

GRAPHICS 

Multiple-byte  sets 

5.3.9 

2/5 

0100101 

Codes  requiring  special  interpretation 

5.3.8 

2/6 

0100110 

2/7 

0100111 

Reserved  for  future  standardization 

5.3.11 

2/8 

0101000 

GO  set 

2/9 

0101001 

Gl  set 

5.3.7 

2/10 

0101010 

Reserved  for  future  standardization 

5.3.1 1 

2/11 

0  10  10  11 

GRAPHICS* 

2/12 

0  10  1  10  0 

GO  set 

2/13 

0101101 

Gl  set 

5.3.7 

2/14 

0101110 

5.3.11 

2/15 

0101111 

Reserved  for  tuture  standardization 

♦There  is  a  single  repertory  of  sets  of  ninety-four  graphic  characters.  Any  member  of  the  repertory  may  be  designated  as  either  a  GO 
or  G1  set.  Four  designating  Escape  sequences,  two  for  GO  and  two  for  Gl,  are  provided  for  designating  members  of  the  repertory. 


(2)  The  second  and  any  additional  Intermediate 
characters  shall  be  associated  with  the  Final  character 
to  permit  additional  entities  within  the  category  de¬ 
fined  by  the  first  Intermediate  character 

(3)  All  Escape  sequences  having  four  or  more  char¬ 
acters  whose  final  character  is  of  the  Ft  type  are  re¬ 
served  for  standardization 

(4)  All  Escape  sequences  whose  final  character  is  of 
the  Fp  type  (private)  are  not  to  be  the  subject  of  stan¬ 
dardization 

5.3.13  Omission  of  Escape  Sequences.  If  the  inter¬ 
changing  parties  have  agreed  upon  a  single  GO  set,  a 


single  Gl  set,  a  single  CO  set,  and  a  single  C 1  set  (or  on 
as  many  of  these  sets  as  are  to  be  used),  they  may  also 
agree  to  omit  the  use  of  Escape  sequences  to  designate 
or  invoke  them.  Interchanging  parties  are  warned,  how¬ 
ever,  that  such  agreements  may  reduce  their  capability 
to  interchange  data  subsequently. 

5.4  Pictorial  and  Tabular  Representations.  Fig.  7  sum¬ 
marizes  in  pictorial  form  the  standard  means  of  code 
extension  within  a  7-bit  environment. 

Table  1  summarizes  in  tabular  form  the  assignment 
of  Intermediate  characters  in  Escape  sequences. 
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Fig.  8 

The  8-Bit  Code  Table 


6.  Structure  of  a  Family  of  8-Bit  Codes 

6.1  General.  The  family  of  8-bit  codes  specified  in  this 
standard  is  obtained  by  the  addition  of  one  bit  to  each 
of  the  bit  combinations  of  the  7-bit  code,  thus  producing 
a  set  of  256  8-bit  combinations.  The  128  characters  of 
the  7-bit  code,  the  set  as  defined  under  5.1,  form  a 
defined  and  integral  part  of  an  8-bit  code  that  is  struc¬ 
tured  in  accordance  with  this  document.  The  128  addi¬ 
tional  bit  combinations,  whose  8th  bit  is  1,  are  available 
for  future  assignment. 

6.2  The  8-Bit  Code  Table.  A  16  X  16  array  of  columns 
numbered  00-15  and  rows  numbered  0-15  contains 
256  code  positions  (see  Fig.  8).  The  8-bit  code  table 
consists  of  an  ordered  set  of  controls  and  graphic  char¬ 
acters  grouped  as  follows  (see  Fig.  8): 

(1)  A  set  of  thirty-two  control  characters  allocated 
to  columns  00  and  01  (CO  set) 

(2)  A  set  of  ninety-four  graphic  characters  allocated 
to  columns  02-07,  subject  to  the  exception  of  posi¬ 
tions  02/0  and  07/15  (GO  set) 

(3)  The  Space  character  in  position  02/0,  which 
may  be  regarded  either  as  a  control  character  or  a  non¬ 
printing  graphic  character 

(4)  A  set  of  thirty-two  control  characters  allocated 
to  columns  08  and  09  (Cl  set) 

(5)  A  set  of  ninety-four  graphic  characters  allocated 
to  columns  10-15,  subject  to  the  exception  of  posi¬ 
tions  10/0  and  15/15 

The  control  characters  in  columns  08  and  09  of  an 
8-bit  code  shall  not  include  transmission  control  char¬ 


acters.  Provision  of  data  transmission  capability  for  8- 
bit  codes  includes  the  use  of  the  Data  Link  Escape  char¬ 
acter  and  is  covered  in  American  National  Standard 
Procedures  for  the  Use  of  the  Communication  Control 
Characters  of  American  National  Standard  Code  for 
Information  Interchange  in  Specified  Data  Communi¬ 
cation  Links,  X3.28-1971. 

6.3  The  Family  Concept.  In  order  to  cope  with  the 
different  needs  of  the  various  industries,  fields  of  appli¬ 
cation,  or  systems,  this  standard  defines  the  concept  of 
a  family  of  8-bit  codes  as  follows: 

(1)  A  set  of  thirty-two  additional  control  characters 
can  be  selected  for  columns  08  and  09 

(2)  A  set  of  ninety-four  additional  graphic  charac¬ 
ters  can  be  selected  for  columns  10-15  (excluding 
positions  10/0  and  15/15) 

There  are  standard  techniques  for  identifying  selec¬ 
tions  of  sets  of  controls  and  graphics  for  8-bit  codes. 
These  techniques  are  described  in  Sections  7,  8,  and  9. 

7.  The  Use  of  Code  Extension  in  an  8-Bit  Code 

7.1  General.  The  techniques  of  extending  an  8-bit  code 
described  in  this  standard  have  been  purposely  made 
compatible  with  those  used  to  extend  the  7-bit  code. 

The  Escape  character  is  used  in  an  8-bit  code  in 
exactly  the  same  way  as  in  the  7-bit  code  to  construct 
ESC  sequences.  Except  as  provided  in  7.2,  the  meanings 
of  these  sequences  are  not  altered  in  an  8-bit  code.  All 
characters  in  columns  08-15  are  excluded  from  assign- 
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ment  in  Escape  sequences  and  any  occurrences  of  them 
in  an  Escape  sequence  are  error  conditions  for  which 
no  standard  recovery  procedures  are  prescribed  in  this 
standard. 

7.2  Defining  an  8-Bit  Code.  As  described  in  Section  6, 
the  code  table  can  be  considered  as  having  four  main 
parts:  the  CO  control  set,  the  GO  graphic  set,  the  Cl 
control  set,  and  the  G1  graphic  set. 

The  remainder  of  the  code  table  consists  of  positions 
02/0  SP,  07/15  DEL,  10/0,  and  15/15. 

The  CO  and  GO  sets  are  designated  and  the  CO  set  is 
invoked  as  in  the  7-bit  environment  (see  5.3.5  and 
5.3.7). 

The  Cl  set  of  control  characters  is  designated  and 
invoked  by  means  of  an  Escape  sequence  as  in  the  7-bit 
environment  (see  5.3.6).  These  control  characters  are 
represented  by  the  bit  combinations  of  columns  08  and 
09. 

The  G1  set  of  graphic  characters  is  designated  by 
means  of  an  Escape  sequence  as  in  the  7-bit  environ¬ 
ment  (see  5.3.7).  These  graphic  characters  are  repre¬ 
sented  by  the  bit  combinations  of  columns  10-15. 

7.3  Code  Extension  by  Means  of  Escape  Sequences. 
Once  the  8-bit  code  is  established  in  accordance  with 
7.2,  code  extension  means  are  available,  making  use  of 
Escape  sequences  as  described  therein  and  in  7.3.1 
through  7.3.3. 

7.3.1  Two-Character  Escape  Sequences.  Two- 
character  Escape  sequences  have  the  same  structure 
as  in  the  7-bit  environment  (see  5.3.2). 

7. 3. 1.1  ESC  Fs  sequences  represent  single  addi¬ 
tional  controls  with  the  same  meaning  they  have  in  the 
7-bit  environment. 

7.3. 1.2  The  use  of  ESC  Fe  sequence  in  an  8-bit 
environment  is  contrary  to  the  intention  of  this  stan¬ 
dard  but,  should  they  occur,  their  meaning  is  the  same 
as  in  the  7-bit  environment. 

7.3.2  Three-Character  Escape  Sequences.  Three- 
character  Escape  sequences  have  the  same  structure  as 
in  the  7-bit  environment  (see  5.3.2). 

7.3.3  Escape  Sequences  Having  Four  or  More  Char¬ 
acters.  These  sequences  have  the  same  structure  and 
meaning  as  in  the  7-bit  environment  (see  5.3.2  and 
5.4). 

7.4  Pictorial  Representation.  Fig.  9  summarizes  in 
pictorial  form  the  standard  means  of  extension  avail¬ 
able  in  an  8-bit  environment. 


8.  Announcement  of  Extension  Facilities  Used 

The  class  of  three-character  Escape  sequences  ESC  2/0  F 


is  used  in  data  interchange  to  announce  the  code  exten¬ 
sion  facilities  utilized  in  the  data  that  follow.  Subject  to 
agreement  between  or  among  the  interchanging  parties, 
such  an  announcing  sequence  may  be  omitted.  The 
Final  character  of  the  announcing  sequence  indicates 
the  facilities  used  for  representing  graphic  sets  in  7-  and 
8-bit  environments  and  the  number  of  bits  used  as 
given  in  Table  2. 


9.  Relationship  between  7-Bit  and  8-Bit  Codes 

9.1  Transformation  between  7-Bit  and  8-Bit  Codes. 

Transformation  between  7-bit  and  8-bit  codes  depends 
on  which  facilities  of  code  extension  are  included  in 
the  application.  Examples  of  such  transformations  are 
given  in  Fig.  B7  and  B8  in  flowchart  form. 

9.2  Representation  of  the  7-Bit  Code  in  an  8-Bit  En¬ 
vironment.  It  may  sometimes  be  desirable,  as  for  exam¬ 
ple  in  a  store  and  forward  application,  to  retain  infor¬ 
mation  in  7-bit  form  while  in  an  8-bit  environment.  In 
this  case,  for  each  of  the  characters,  bits  b7-bi  are  rep¬ 
resented  by  bits  a7-ai ,  respectively,  and  a8  is  set  to 
zero. 

Table  2 


Announcer  Sequences  and  Meanings* 


Final 

Characters 

Facilities  Used 

4/1 

The  GO  set  only  is  to  be  used.  The  Escape  se¬ 
quence  which  designates  this  set  also  invokes  it 
into  columns  2-7.  SI  and  SO  are  not  to  be  used. 

In  an  8-bit  environment,  columns  10-15  are  not 
used. 

4/2 

The  GO  and  G1  sets  are  to  be  used.  In  both  7-  and 
8-bit  environments,  SI  invokes  GO  into  columns 
2-7  and  SO  invokes  G1  into  columns  2-7.  In  an 
8-bit  environment,  columns  10-15  are  not  used. 

4/3 

The  GO  and  G1  sets  are  to  be  used  in  an  8-bit  en¬ 
vironment  only .  The  designating  Escape  sequences 
also  invoke  the  GO  and  G1  sets  into  columns  2-7 
and  10-15,  respectively.  SI  and  SO  are  not  to  be 
used. 

4/4 

The  GO  and  G1  sets  are  to  be  used.  In  a  7-bit  en¬ 
vironment,  SI  invokes  GO  into  columns  2-7  and 
SO  invokes  G1  into  columns  2-7.  In  an  8-bit  envi¬ 
ronment,  the  designating  Escape  sequences  also 
invoke  the  GO  and  G1  sets  into  columns  2-7  and 
10-15,  respectively.  SI  and  SO  are  not  to  be  used. 

*A  pictorial  representation  of  these  cases  is  shown  in  Fig.  10. 


NOTE:  In  a  7-bit  environment,  data  announced  by  a  sequence 
ESC  2/0  4/4  have  the  same  form  as  data  announced  by  a  se¬ 
quence  ESC  2/0  4/2.  The  announcer  ESC  2/0  4/4  is  provided 
for  those  interchange  situations  in  which  it  is  agreed  to  differ¬ 
entiate  between  7-bit  and  8-bit  originated  data  in  the  7-bit 
environment. 


18 


AMERICAN  NATIONAL  STANDARD  X3. 41-1974 


F  INAL 

CHARACTER 


7-BIT 

ENVIRONMENT 


8-BIT 

ENVIRONMENT 


4/1 


4/3 


4/4 


NOT  USED 


DODD 


00 

DO  DODD 


SO 


Fig.  10 

Pictorial  Representation  of  the  Announcer  Facilities 
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Indication  that  true  8-bit  coded  data  follow  is 
achieved  by  the  occurrence  of  the  announcing  sequences 
ESC  2/0  4/3  or  ESC  2/0  4/4. 

Indication  that  7-bit  coded  data  follow  is  achieved 
by  the  occurrence  of  one  of  the  announcing  sequences 
ESC  2/0  4/1  or  ESC  2/0  4/2. 

9.3  Representation  of  Positions  10/0  and  15/15  in  a 
7-Bit  Environment.  No  meaning  is  assigned  to  positions 
10/0  and  15/15  in  this  standard.  If  there  is  a  require¬ 
ment  to  represent  these  positions  in  a  7-bit  environ¬ 
ment,  a  private  Escape  sequence  shall  be  used. 


10.  Specific  Meaning  of  Escape  Sequences 

The  meanings  of  individual  Escape  sequences  are  not 
specified  in  this  standard.  Instead,  their  meanings  will 
be  specified  using  the  procedures  for  the  registration 
of  characters  and  character  sets  given  in  a  related  stan¬ 
dard  currently  under  development  which  are  to  be  fol¬ 
lowed  in  preparing  and  maintaining  a  register  of  Escape 
sequences  and  their  meanings.  These  registration  pro¬ 
cedures  do  not  apply  to  Escape  sequences  reserved  for 
private  use. 
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Appendix  A 

Implementation  Considerations 


Al.  Appearance  of  Control  Characters  in  Escape  trols  may  be  interjected  into  Escape  sequences  as  a  part 
Sequences  of  the  communication  control  procedures. 


A  1.1  Although  this  standard  precludes  the  assignment 
of  control  characters  from  columns  0  and  1  in  Escape 
sequences,  it  does  not  necessarily  preclude  the  occur¬ 
rence  of  control  characters  in  Escape  sequences.  System 
designers  must  be  aware  that  additional  standards  may 
have  to  be  considered  if  the  occurrence  of  control  char¬ 
acters  is  to  be  taken  into  account.  Of  particular  concern 
is  American  National  Standard  Procedures  for  the  Use 
of  the  Communication  Control  Characters  of  American 
National  Standard  Code  for  Information  Interchange  in 
Specified  Data  Communication  Links,  X3.28- 1971.  Un¬ 
der  the  provisions  of  that  standaid,  communication  con- 


Table  Al 

Suggested  Validity  of  Communication 


Control  Characters  in  Sequences 


Communication  Control 
Character  or 

Character  Sequence 

May  Validly  Occur  in  an 
Escape  Sequence  at  a  Receiver 

SOU 

Yes 

STX 

Yes 

ETX 

No 

EOT 

No 

ENQ 

No 

ACK 

No 

DLE 

Yes 

NAK. 

No 

ETB 

Yes 

SYN 

Yes 

DLE  SOH 

Yes 

DLE  STX 

Yes 

DLE  ETX 

No 

DLE  EOT 

No 

DLE  ETB 

Yes 

DLE  = 

Yes 

DLE  0 

No 

DLE  1 

No 

DLE  2 

No 

DLE  3 

No 

DLE  4 

No 

DLE  5 

No 

DLE  6 

No 

DLE  7 

No 

DLE  SYN 

Yes 

A  1.2  In  interpreting  data  streams,  the  communication 
control  characters  and  communication  control  sequences 
formed  with  the  Data  Link  Escape  (DLE)  character  as 
well  as  any  associated  error  check  characters  (Block 
Check  characters  or  Cyclic  Redundancy  Check  charac¬ 
ters),  if  present,  should  not  be  allowed  to  affect  the 
meanings  of  Escape  sequences  in  which  they  occur.  The 
meaning  of  the  Escape  sequences  is  to  be  reckoned  as  if 
the  communication  controls  had  been  purged.  Of 
course,  this  doctrine  cannot  be  applied  when  the  occur¬ 
rence  of  a  communication  control  would  have  the  ef¬ 
fect  of  terminating  a  portion  of  a  message  prior  to  the 
transmission  of  the  Final  character  of  the  affected 
Escape  communication  control  characters  and  Data 
Link  Escape  sequences  and  the  possibility  of  their  oc¬ 
curring  with  validity  in  an  Escape  sequence  at  the  input 
to  the  receiving  portion  of  the  system.  The  occurrence 
of  a  communication  control  character  or  character  se¬ 
quence  formed  with  DLE  which  would  not  validly  oc¬ 
cur  in  the  data  stream  should  be  considered  as  an  error 
condition.  This  Appendix  does  not  prescribe  the  means 
of  recovery  from  this  error  condition.  For  suggested 
examples  of  the  preceding,  see  Table  AL 


A2.  interactions  When  Operating  in  Multipro¬ 
gramming  and  Multiprocessing  Environments 

The  requirements  and  restrictions  that  must  be  satisfied 
in  either  a  multiprogramming  or  multiprocessing  envi¬ 
ronment  are  not  increased  over  those  required  in  a  sim¬ 
ple  batch  environment  by  the  introduction  of  data  that 
include  Escape  sequences.  These  requirements  and  re¬ 
strictions  include  the  serial  examination  of  the  data  to 
ensure  that  all  Escape  sequences  are  interpreted,  and 
recovery  procedures  to  restart  after  error  take  into  ac¬ 
count  the  serial  dependency  of  the  data.  Also,  that  pro¬ 
vision  is  made  that  no  characters  in  Escape  sequences 
are  skipped  or  encountered  in  an  incorrect  sequence. 

The  last  requirement  precludes  access  being  per- 
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mitted  to  stored  data  at  any  point  where  the  Escape 
sequence  is  not  recognized,  or  to  magnetic  tape  using 
read  backward  if  the  Escape  sequences  were  only  pro¬ 
vided  for  read  forward. 

A3.  Unique  Interchange  by  Prior  Agreements 

In  order  that  the  code  represented  by  a  character  string 
can  be  unambiguously  identified,  the  character  string 
should  be  preceded  by  a  designating  Escape  sequence. 
However,  under  certain  conditions,  this  Escape  sequence 
can  be  omitted  as  illustrated  in  the  following  examples: 

(1)  If  only  ASCII  or  an  agreed-upon  8-bit  code  is 
used,  it  is  not  required  to  designate  these  codes. 

(2)  If  ASCII  or  an  ASCII  variant  is  used  with  a  sin¬ 
gle  alternative  graphic  set,  it  is  not  required  to  use  des¬ 
ignating  Escape  sequences. 

(3)  If  interchange  is  only  in  another  code,  such  in¬ 
terchange  is  beyond  the  scope  of  this  standard.  Any 
necessary  conventions  are  subject  to  the  agreement  of 
the  parties  to  the  interchange. 

However,  it  is  a  recommended  practice  that  the  des¬ 
ignating  Escape  sequence  be  included  in  every  inter¬ 
change  in  order  to  preserve  the  generality  of  the  appli¬ 
cation  and  avoid  unnecessary  limitations  to  the  arena 
of  interchangeability  of  the  data. 

A4.  System  Startup  and  Restart 

When  a  link  is  established  or  reinitialized,  in  the  ab¬ 
sence  of  any  other  agreement  between  or  among  the 
exchanging  parties,  the  basic  code  in  effect  is  assumed 
to  be  ASCII  (7-bit  environment).  If  the  interchange  is 
to  be  in  another  code  where  no  prior  agreement  be¬ 
tween  or  among  the  exchanging  parties  exists,  appro¬ 
priate  Escape  sequences  must  be  used. 

A5.  Future  Work  on  Code  Extension 

This  Appendix  contains  additional  code  extension  tech¬ 
niques  that  have  been  discussed.  This  is  one  approach 
to  the  problem  of  accommodating  more  than  two  pages 
of  graphics,  if  required  to  do  so. 


A6.  Additional  Invokers  and  Announcers 

A6.1  Definitions  of  Additional  Invokers 

A6.1.1  ESI.  ESI  is  a  mnemonic  for  the  control  char¬ 
acter  Extended  Shift-In.  This  control  is  used  in  8-bit 
extended  codes,  and  its  ESC  Fe  counterpart  is  used  in 
some  7-bit  extended  codes  to  invoke  the  last  designated 
G2  graphic  set. 

A6.1.2  ESO.  ESO  is  a  mnemonic  for  the  control 
character  Extended  Shift-Out.  This  control  is  used  in 
8-bit  extended  codes  and  its  ESC  Fe  counterpart  is 
used  in  some  7-bit  extended  codes  to  invoke  the  last 
designated  G3  graphic  set. 

A6.2  Categories.  These  categories  are  primarily  in¬ 
tended  for  8-bit  codes,  but  the  facility  for  their  use  is 
available  to  the  application  designer  or  user  in  7-bit  work¬ 
ing  where  full  understanding  of  the  implications  exists. 

ESC  2/10  F  and  ESC  2/14  F  designate  sets  of  ninety- 
four  graphic  characters  which  will  be  used  as  the  G2 
set.  The  designated  set  is  invoked  by  ESI  or  the  Escape 
sequence  ESC  4/15,  when  appropriate. 

A6.3  Future  Announcers.  See  Table  A2. 

Table  A 2 


Future  Announcers 


Final 

Character 

Facilities  Used 

4/5 

The  GO  set  of  graphics  is  to  be  used.  In  both  7- 
and  8-bit  environments,  the  Escape  sequence 
which  designates  this  set  also  invokes  it  into  col¬ 
umns  2-7.  SI  and  SO  are  not  to  be  used.  The  Cl 
set  of  controls  is  to  be  used  also.  In  an  8-bit  en¬ 
vironment,  individual  control  characters  of  the 

Cl  set  of  controls  will  be  represented  by  ESC  Fe 
sequences  and  not  by  the  bit  patterns  of  columns 
08  and  09.  Thus,  in  an  8-bit  environment,  col 
umns  08-15  are  not  used. 

4/6 

The  GO  and  G1  sets  of  graphics  are  to  be  used.  In 
both  7-  and  8-bit  environments,  SI  invokes  GO 
into  columns  2-7  and  SO  invokes  G 1  into  col¬ 
umns  2-7.  The  Cl  set  of  controls  is  to  be  used 
also.  In  an  8-bit  environment,  individual  control 
characters  of  the  Cl  set  of  controls  will  be  repre¬ 
sented  by  ESC  Fe  sequences  and  not  by  the  bit 
patterns  of  columns  08  and  09.  Thus,  in  an  8-bit 
environment,  columns  08-15  are  not  used. 

NOTE:  Additional  announcers  would  be  required  to  cover  G2 
and  G3  sets  of  graphics. 
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Illustration  of  Implementation 


Bl.  General 

The  implementation  of  the  extension  of  the  7-bit  code, 
described  in  this  standard,  depends  on  the  practical 
need  of  the  implementor  or  user  of  the  code  extension. 
Implementations  may  differ  in  the  following  aspects 
and  still  remain  in  accordance  with  this  standard: 

( 1 )  Including  all  or  parts  of  the  control  sets  and 
graphic  sets 

(2)  Identifying  the  sets  by  Escape  sequences,  or 
having  an  agreement 

(3)  Dropping  the  superfluous  invokers  or  not 

(4)  Examining  the  structure  of  an  Escape  sequence 
or  not 

(5)  Handling  the  character  DEL 

(6)  Combinations  of  the  aforementioned 

In  this  Appendix  there  are  shown  a  few  of  the  pos¬ 
sible  implementations  that  may  help  the  user  to  a  better 
understanding.  A  summary  of  Fig.  Bl  through  B8  is 
given  in  Table  Bl .  There  are  three  types  of  figures:  struc¬ 
ture,  interpretation,  and  transformation. 

Table  B2  shows  the  symbols  used  in  the  flowcharts 
(see  Fig.  B4  through  B8)  and  their  associated  meaning. 
Table  B3  shows  the  items  used  in  the  flowcharts,  their 
meaning,  and  the  possible  values. 

B2.  Structure 

The  pictorial  representations  of  the  structure  of  the  7- 
bit  and  8-bit  code  extension  (see  Fig.  Bl  through  B3) 


have  all  the  same  format  as  follows: 

( 1 )  At  the  top  of  the  figure  are  the  repertories  of 
single  controls  and  sets  of  controls 

(2)  Below  that  are  the  Escape  sequences  that  are 
used  for  the  designation  and  invocation  of  the  control 
sets  as  the  CO  and  Cl  set 

(3)  At  the  bottom  of  the  figure  is  the  repertory  of 
graphic  sets 

(4)  Above  that  are  the  Escape  sequences  used  for 
designation  of  the  graphic  set  as  the  GO  or  G1  set 

(5)  Above  that  are  the  shift  characters  that  are  used 
for  the  invocation  of  the  GO  or  G1  sets 

(6)  All  this  comes  together  in  the  center  to  form  the 
7-bit  set  or  8-bit  set  in  use 


B3.  Interpretation 

The  interpretation  of  data  containing  7-bit  or  8-bit  code 
extension  is  shown  by  means  of  flowcharts  (see  Fig.  B4 
through  B6).  The  interpretation  is  separate  from  the 
transformation. 


B4.  Transformation 

The  transformation  of  the  data  from  7-bit  to  8-bit  and 
from  8-bit  to  7-bit  is  shown  by  means  of  flowcharts  (see 
Fig.  B7  and  B8).  The  transformation  is  separate  from 
the  interpretation. 


Table  Bl 

Index  to  Figures  of  Appendix  B 

Figures 


Transformation 


Included  Sets 

Bits 

Structure 

Interpretation 

7  to  8 

8  to  7 

CO,  Cl,  GO,  G1 

7 

Bl 

B4 

B7 

B8 

CO,  Cl,  GO,  G1 

8 

B2 

B5 

- 

- 

CO,  Cl,  GO,  G1 

7  and  8 

B3 

B6 

- 

- 
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Table  B2 

Meaning  of  Flowchart  Symbols 


Table  B3 

Items  Used  in  the  Flowcharts 
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INVOCATION 
OF  GRAPHIC 
SETS 


DESIGNATION 
OF  GRAPHIC 
SETS 


MULTIPLE- 

BYTE 

GRAPHIC 

REPERTORY 


W 

SI  SO 


ESC  2/4  F  ESC  2/12  F  ESC  2/13  F 


GRAPHIC 

REPERTORY 


EXTENDED 

CONTROL 

REPERTORY 


Fig.  B1 

7-Bit  Code  Extension  Structure  Using  CO,  Cl,  GO,  and  G1  Sets 
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CONTROL 

REPERTORY 


DESIGNATION 
OF  GRAPHIC 
SETS 


MULTIPLE- 

BYTE 

GRAPHIC 

REPERTORY 


GRAPHIC 

REPERTORY 
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Fig.  B2 

8- Bit  Code  Extension  Structure  Using  CO,  C 1 ,  GO,  and  G 1  Sets 
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DESIGNATION 
OF  GRAPHIC 
SETS 


/ 


ESC  2/4  F 

/ 


V 

ESC  2/8  F 
ESC  2/12  F 


ESC  2/9  F 
ESC  2/13  F 


MULTIPLE- 
BYTE 
GRAPHIC 
RE  PERTORY 


GRAPHIC 
RE  PERTORY 


Fig.  B3 

Composite  7-Bit  and  8-Bit  Code  Extension  Structure 
Using  CO,  Cl,  GO,  and  G1  Sets 
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Fig.  B4 

Interpretation  of  7-Bit  Data  with  CO,  C 1 ,  GO,  and  G 1  Sets 
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Fig.  B5 

Interpretation  of  8-Bit  Data  with  CO,  C 1 ,  GO,  and  G 1  Sets 
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Fig.  B6 

Interpretation  of  7-Bit  and  8-Bit  Data  with  CO,  C 1 ,  GO,  and  G 1  Sets 
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Fig.  B7 

Transformation  from  7  Bits  to  8  Bits  Including  CO,  C 1 ,  GO.  and  G  1  Sets 
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Fig.  B8 

Transformation  from  8  Bits  to  7  Bits  Including  CO,  Cl ,  GO,  and  G1  Sets 
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